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ABSTRACT 

We present an investigation into the effects of survey systematics such as vary¬ 
ing depth, point spread function (PSF) size, and extinction on the galaxy selection 
and correlation in photometric, multi-epoch, wide area surveys. We take the Canada- 
France-Hawaii Telescope Lensing Survey (CFFITLenS) as an example. Variations in 
galaxy selection due to systematics are found to cause density fluctuations of up to 
10% for some small fraction of the area for most galaxy redshift slices and as much as 
50% for some extreme cases of faint high-redshift samples. This results in correlations 
of galaxies against survey systematics of order ^1% when averaged over the survey 
area. We present an empirical method for mitigating these systematic correlations from 
measurements of angular correlation functions using weighted random points. These 
weighted random catalogues are estimated from the observed galaxy over densities by 
mapping these to survey parameters. We are able to model and mitigate the effect of 
systematic correlations allowing for non-linear dependencies of density on systematics. 
Applied to CFHTLenS we find that the method reduces spurious correlations in the 
data by a factor two for most galaxy samples and as much as an order of magnitude in 
others. Such a treatment is particularly important for an unbiased estimation of very 
small correlation signals, as e.g. from weak gravitational lensing magnification bias. 
We impose a criterion for using a galaxy sample in a magnification measurement of 
the majority of the systematic correlations show improvement and are less than 10% 
of the expected magnification signal when combined in the galaxy cross correlation. 
After correction the galaxy samples in CFFITLenS satisfy this criterion for Zphot < 0.9 
and will be used in a future analysis of magnification. 

Key words: galaxies: photometry, galaxies: statistics, gravitational lensing: weak, 
methods: data analysis 


1 INTRODUCTION 

Measurements of the angular correlation of galaxies are pow¬ 
erful tools for probing the properties of galaxies and cos¬ 
mology. 2-point correlation functions such as galaxy auto- 

* Based on observations obtained with MegaPrime/MEGACAM, 
a joint project of CFHT and CEA/DAPNIA, at the Canada- 
France-Hawaii Telescope (CFHT) which is operated by the Na¬ 
tional Research Council (NRC) of Canada, the Institut National 
des Sciences de I’Univers of the Centre National de la Recherche 
Scientifique (CNRS) of France, and the University of Hawaii. This 
work is based in part on data products produced at TERAPIX 
and the Canadian Astronomy Data Centre as part of the Canada- 
France-Hawaii Telescope Legacy Survey, a collaborative project 
of NRC and CNRS. 

t E-mail: cmorrison@astro.uni-bonn.de 


correlation functions, weak gravitational lensing shear and 
weak lensing magnification are the basic observables of many 
current and future photometric surveys. However, the sig¬ 
nals in these correlations can be contaminated by systematic 
errors arising from spatially inhomogeneous selections. Mea¬ 
surements of magnification bias especially suffer from these 
effects due to its small amplitude, even at small scales. While 
this paper takes systematics in magnification bias measure¬ 
ments as an example, the methods described herein will be 
useful for any small- to intermediate-scale, configuration- 
space correlation measurement. 

Magnification has shown utility in measuring both 
mass and cosmology. Many of these measurements have 
been of magnification bias, the density change of galax¬ 
ies due to lensing by interve n ing mass (iFord et al.l 20li 
|2014 iHildebrandt et ^ l2009l. I2OIII. EoTsT Morrison et al.l 
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l2012l : IScranton et alj [iooBI : iBauer et allboilf) . Other mea- dataset. Finally, in Sect. Q we summarize and give an out- 


surements have been also 

performed using magnitudes look to applications within CFHTLenS and beyond. 

and sizes dBauer et al.l 1201 1 

, bm 

1: iHuff & Graves! |2(Jll|: 

iMenard et al. 20101: Schmidt et al.l 

2 OI 2 I). These measure- 


ments have been shown to be compleme ntary to other 
prob e s of leasing of large-scale s tructure (Ivan Waerbekd 
|201(]I : iMenard fc BartelmanrJ I 2 OO 2 I I . Much of this comple¬ 
mentarity is from the magnification signal coming for free, 
that is the same survey design and data reduction for weak 
leasing shear surveys also allows for magnification. Indeed, 
weak leasing magnification does not rely on shape measure¬ 
ments meaning that it will not suffer from the same system¬ 
atic errors. This comes at the cost of greater sensitivity to 
photometry errors and selection effects induced by survey 
systematics such as depth and PSF size. While magnifica¬ 
tion shows large sensitivity to these selection effects they 
will also be present in other correlation analyses including 
weak leasing shear though as second-order effects. 

Much of magnification’s and other correlations’ mass 
and cosmological constraining power comes from measur¬ 
ing the signal at different radial distances and hence cos¬ 
mic time. Selecting galaxies at a given distance is commonly 
done through colour and magnitude cuts or more generally 
photometric redshifts (photo-z). These selections should be 
uniform across the survey to maximize their scientific utility. 
However, variations in selection efficiency across the survey 
can induce spurious systematic errors into the correlations. 
This selection function can be caused by variations in sur¬ 
vey depth, PSF full width half max (FWHM), extinction, 
and stellar density, for example. The different colour and 
photo- 2 : selections cause the inhomogeneities to be different 
for each galaxy sample considered. 

Current and future multi-epoch surveys will all suf¬ 
fer from such varying selection functions that will con¬ 
taminate correlation measures. By leveraging the statistical 
power of the Canada-France-Hawaii-Telescope Leasing Sur¬ 
vey (CFHTLenS) dataset, we investigate the effect of depth, 
PSF size, and extinction on galaxy density for several dif¬ 
ferent galaxy samples. We find that these systematics can 
have an influence the density, changing it by 10% in some 
small fraction of the survey area for most samples and up to 
50% for some faint high-z samples. We then build an empiri¬ 
cal, cosmology-independent method for reducing the effect of 
these systematics on correlation measurements. Techniques 
developed on CFHTLenS can be immediately used on other 
similar surveys such as the Red Sequence Cluster Lensing 
Survey (RCSLenSa-nd the Kilo Degree Survey (KiDS as 
well as any survey using a data pipeline and survey strategy 
similar to CFHTLenS. 

The paper is organized in the following way. The 
CFHTLenS data are presented in Sect. m Sect.l discusses 
angular systematics and how we relate those to the galaxy 
density. Results are presented in Sect. [T] where we show the 
galaxy densities as a function of the different systematics 
and the resulting spurious correlations. In section Sect.[5]we 
develop our method for modeling and removing angular sys¬ 
tematics with weighted random points. In Sect.|6] we show 
the results from applying this correction to the CFHTLenS 


2 DATA 


The CF HTLenS dat a are de scribed in iH eymans et al.l 


||201^, Erben et al.l (2013fl . I Miller et all ( 2013l l and 


iHildebrandt et al.l ( 2012 1. Here we will concentrate on the 


aspects of the data used in this analysis. CFHTLenS is a 
5 band [ugriz) imaging survey consisting of 171 pointings 
of ~ 1 sq. deg. each observed in 4 different regions on the 
sky. Within each pointings the data are observed using 4-7 
stepped dithers depending on the band pass. This observing 
strategy yields little overlap between pointings and CCDs. 
The unmasked area of the survey covers 117 sq. deg. This 
area represents the strictest masking available for the sur¬ 
vey. Reduced im ages and catalogues were produced using the 
THELI-pipeline (lErben et al.ll2005l f. Objects are detected in 
the i-band and the multi-colour photometry is corrected for 
different PSF sizes between bands by p erforming a PSF ho¬ 
mogenization (IHildebrandt et aDl2012fl . 

Galaxies in this analysis are selected in two broad sam¬ 
ples. The first, hereafter referred to as photo-z samples, are 
selected to have 19 < i < 24 magnitudes w ith photometric 
redshifts selected from BPZ (lBeriftejl2000l l best fit redshift, 
Z6. The other samples, hereafter re ferred to as LBGs, are se - 
lected using the dropout criteria of lHildebrandt et al.l (l2009fl 
and have magnitude cuts of 23 < r < 24 and 23.5 < i < 24.5 
for the u- and g-dropouts, respectively. These samples range 
from a total of ~1 million galaxies in some of the photo-z 
samples to ~30 thousand galaxies in the LBG samples. 

For this analysis we consider 11 different survey sys¬ 
tematics in total. We consider the limiting magnitude, also 
referred to as the depth, in each of the five observed bands 
in the survey. The depths are estimated from the inverse 
variance images created during the THELI stacking pro¬ 
cess. We also use the PSF size in each of the five bands. 
W e utilise the sizes a s mode led for the PSF homogenization 
of lHildebrandt et al.l (l2012l i. The stellar density in the sur¬ 
vey determines the fidelity of the PSF model. Small-scale 
variations can not be captured due to the limited amount 
of information and the smoothing due to the functional fit. 
However, we do not expect the size of the stacked PSF to 
vary too much at small scales. As the last survey system¬ 
atic we consider galactic extinction which we estimate from 
the E(B — F) valu es from the Schlegel-Finkbeiner-Davis 
dSchlegel et al.l[l9^ . SFD) dust map. 

To encode the survey geometry and and position depen¬ 
dence of the sy stematics we utilise t he spherical pixelisation 
code STOMP dScranton et al.l 120021 1^. This software allows 
us to pixelate the area of the survey and include complicated 
masking. We create constant resolution STOMP maps en¬ 
coding the number of galaxies as a function of position for 
a galaxy sample. The number counts of galaxies are sample 
at pixels of 16 sq. arcminutes and are summed or smoothed 
as need requires.. We also create constant resolution maps 
that encode variations in depth, PSF FWHM, and extinc¬ 
tion. In Fig. [T] we show the i-band depth and seeing for a 


^ http://www.rcslens.org 
^ http://kids.strw.leidenuniv.nl/ 


http: / / code.google.com/p/astro-stomp 
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Figure 1. Map of the i-band depth (left) and PSF size (right) for one CFHTLenS pointing (~ 1 sq. deg.). Each side of this plot is 1 
sq. deg. Values for the depth are relative to the mean of the field with the PSF size being fraction minus 1 relative to the mean. The 
depth shown is estimated from the inverse variance image produced during stacking. Red corresponds to deeper regions whereas green 
and blue correspond to shallower regions. The few blue pixels roughly one magnitude shallower than the red regions. The PSF size is 
estimated from the model used for the PSF homogenization of lHildebrandt et al.l ll2012h . The colour scale shows large PSF in red/green 
and small PSF in blue/purple. 


~ 1 sq. deg. pointing in CFHTLenS. One can clearly see 
the effect of the dither pattern in the depth map. This pat¬ 
tern is repeated in the other CFHTLenS bands which leads 
to a strong correlation of the depth in the different bands. 
Note that the shallowest pixels are up to one magnitude 
shallower than the mean of the pointing. There are also 
smaller scale structures from pixel to pixel sensitivity dif¬ 
ferences. These variations change the number density and 
properties of the detected galaxies in the i-band and also 
affect the colours of galaxies through the depth of the other 
four bands. The PSF size varies at much larger scales as 
shown in the right hand panel of Fig. [T] This affects both 
the detection efficiency (in the i-band) as well as the colour 
selection (in all bands) because the seeing influences the 
noise for small objects. Larger PSFs mean the detected sur¬ 
face brightness of galaxies decreases which affects errors and 
possibly also magnitude estimates through noise bias. Some 
of these colou r differences are correcte d in the PSF homog¬ 
enization of (iHildebrandt et n]|2012| ). however, the noise 
variations associated with the varying PSF will remain and 
affect selections. 

Fig.[2]shows the extinction for the same pointing seen in 
Fig. [T] The galactic dust causes a dimming and a reddening 
of galaxy colours. While CFHTLenS corrects for the redden¬ 
ing effect the dimming leads to a position dependent depth 
variation with the same consequences as the depth variation 
due to the dithering discussed above. The commonly used 
dust maps are also contaminated by light from high-redshift 
galaxies. This can lead to an unwanted physical correlation 
between the extinction correction and cosmic density. 

We also encode the d ensity of stars from two dif- 
ferent star s electi ons from IHildebrandt et ^ (|2012|) and 
iMiller et al.l (|2ni.‘lh . In the end, however, we do not utilize 
the stellar densities in this analysis due to difficulties with 
star-galaxy separation as described in Sect. 14.51 


AE(B - V) X 10 [MagAB] 



Figure 2. Map of E{B — V) extinction relative to the mean 
of the pointing for the same CFHTLenS pointing as shown in 
Fig.m Red represents regions of high extinction blue and purple 
regions of low extinction compared to the pointing mean. The 
values for the e xtinct ion are taken from the galactic dust maps of 
ISchlegel et alj lll998h over-sampled to the same resolution as the 
depth and seeing maps. 


3 GALAXY DENSITY ESTIMATES 

Angular systematics in clustering measurements can arise 
from a position-dependent selection function caused by vari¬ 
ations in instrumental and astrophysical quantities. The 
position-dependence gets imprinted on the data. Mathemat¬ 
ically this can be written as 

g ^ S{e)N{6) - {S{d)N{6)) 

{s{e)Nie)) 
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where 5{6) is the galaxy over-density at position 6, N is the 
true galaxy number count, and S is the sample-dependent 
selection. This factor can be significant, changing the den¬ 
sity by as much as ~ 50% for certain samples in CFHTLenS. 
In the remainder of the paper we distinguish between 5 
and S where the latter quantity is our estimate from the 
CFHTLenS data, i.e. including shot-noise from the finite 
density of the survey. CFHTLenS is split up into individual 
pointings of roughly 1 sq. deg. each. Within these pointings 
the data are observed using 4-7 stepped dithers depending 
on the band pass. This observing strategy yields little over¬ 
lap between pointings and CCDs as can be see in Fig. [T] 
In this figure, green and blue regions are chip gaps between 
CCDs which are significantly shallower than the mean of 
the survey by 0.2 magnitude s. Further informat ion on the 
dither pattern can be found in lErben et al.l ll2013tl . This lim¬ 
ited overlap is a feature shared with many other current and 
near future surveys. This can lead to difficulties in the pho¬ 
tometric calibration between pointings, inducing changes in 
galaxy densities related only to calibration issues. Analyses 
interested in scales larger than the pointing size need to be 
corrected for this effect. We instead remove variations that 
occur from pointing to pointing as we are interested in scales 
of the size of roughly a galaxy halo, much smaller than the 
CFHTLenS pointing size. Practically, we utilize the galaxy 
over-density relative to the mean of the pointing to compute 
our statistics instead of the global survey mean. We similarly 
do this with each systematic considered, i.e. we relate the 
depth/seeing/extinction in a particular pixel to the mean of 
the depth/seeing/extinction in the pointing that the pixel 
resides in. 

In Fig. [3] we compare the dependence of the measured 
over-density on i-band depth before and after going from 
absolute (i.e. relative to the survey mean) to relative (i.e. 
relative to the mean of individual pointings) values. Both 
the galaxy density and i-band depth are estimated on a 
scale of 8 arc-seconds (we use this smoothing for all sub¬ 
sequent density estimates). Note the large increase in signal 
to noise between the two panels. While we show this only for 
the i-band depth this is also true for the other systematic 
quantities. This is probably due to the data reduction being 
done pointing-wise and the catalogues bein g created from 
running SExtractor dBertin fc Arnout j[l99^ on the individ¬ 
ual pointings rather than running the detection over the full 
survey at once. We will utilize this relative scaling for all 
plots in the remainder of the paper. 


4 DEPENDENCE OF GALAXY DENSITY ON 
SYSTEMATICS 

First we compare the over-density of galaxies in each sam¬ 
ple against each of the systematics we consider separately. 
We bin the relative value of the systematic in ranges that 
represent equal area on the sky. We compute the values 
and errorbars by spatially jackknifing over the pointings of 
CFHTLenS (171 in total) contributing to each bin. This 
jackknifing is done with weighting by area, that is point¬ 
ings treated equally in the jackknife without regard for how 
much area they contribute to a particular systematic bin. 
The plots in the following sections show how the density 



Figure 3. Galaxy over-density <5 as a function of i-band depth for 
an i-band selected sample with 19 Si i < 24. The top panel shows 
the over-density with respect to the survey mean galaxy density 
against the limiting magnitude (per pixel). The bottom panel 
shows the same but using galaxy densities values relative to the 
mean in the individual pointings, averaged over the pointings as 
described in §|3] Each data point corresponds to a non-contiguous 
survey area of 1 sq. deg, that is from all pointings in the survey 
containing the systematic value range. The x-axis is plotted on 
a non-linear scale to allow for equal spacing between the data- 
points to show the variations in density more clearly around the 
mean value of depth. 

can depend on the different systematics even for relatively 
bright, low redshift galaxy samples. 

4.1 i-band Selected Sample 

Before presenting results using photo-z and colour-colour se¬ 
lections, we show the density variations of a purely i-band 
selected sample plotted against the i-band depth. The lower 
panel of Fig. |3] shows that the density as a function of rela¬ 
tive i-band depth fluctuates in a complex way. For the deeper 
part data (right half of the lower panel of Fig. [S)), we can 
understand this as the effect of Eddington bias leading to a 
monotonic decrease in density with increasing depth. This 
begins around the value of Ai > 0.05. In general, the Ed¬ 
dington bias in this case describes the asymmetric scatter of 
objects close to the noise limit. Since there are more faint 
than bright objects more objects scatter into the sample 
than out of the sample at the faint end. As the depth in¬ 
creases this effect becomes less important and density de¬ 
creases. This is a general trend seen in the other bands as 
well. We have no such explanation for the shallower por¬ 
tions of the survey (left half of the lower panel of Fig. [Sjl 
though the trends seen there are likely due to complexities 
in the stacking and detection pipeline. While it is tempting 
to claim that the i-band selected sample is completely in¬ 
dependent of effects from the other five bands due to it’s 
selection relying on i-band only, this is not the case here 
as the survey mask used in this analysis uses information 
from all the observed bands and could introduce a selec¬ 
tion dependent on the other bands. This can be seen in the 
next section where we plot these density fluctuations for the 
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Figure 5. Angular cross correlation functions between the observed galaxy density <5 and the survey depth Ssky in each band where 
purple= n, blue=g, green=r, yellow=i, red=z. We keep this convention throughout the paper. We plot all photo-z selected samples up 
to 0.9 < Zphot < 1-1 including the total sample of 0.1 < Zphot < 1-3 (top left). We neglect the 1.1 < Zphot < 1-3 for brevity though it is 
similar to the 0.9 < Zphot <1.1 sample. The vertical dot-dashed line corresponds to the width of a MegaCam Chip. 


0.1 < Zphoi < 1.3 



Figure 4. Plots of the galaxy over-density <5 against the rela¬ 
tive depth in the five CFHTLenS optical bands for the sample 
0.1 < Zphot < 1-3. The dot dashed line visible for large values of 
depth is the percentage of pointings (ranging from all 171 to 0) 
contributing to the density estimate of each depth bin. 


sample 0.1 < Zphot <1.3 which is very similar to the i-band 
selected sample. 

4.2 Galaxy Density vs. Depth 

We now show the plots of galaxy density against the depth in 
all five bands for samples of galaxies in CFHTLenS selected 
in photometric redshift bins. In Fig. [4] we show the density 
variations of the 0.1 < Zphot <1.3 sample against the depth. 
The dependence on i-band depth here is very similar to that 
of Fig. [3] showing an amplitude of up to ~10%. The other 
bands also show significant structure, however, some of this 
is expected to be redundant as the dither patterns are sim¬ 
ilar and therefore the depth varies in a similar way between 
the bands. Indeed upon cross-correlating the depth maps of 
different bands with each other we find that the amplitude 
is roughly 50% of the amplitude of auto-correlation of the 
individual band depth maps. For the low and high redshift 
galaxy samples that are a subsets of the sample shown in 
Fig. |4l the density variations in the u and z-bands become 
more apparent as these bands are important for distinguish¬ 
ing high and low redshifts. 

The cross-correlations of the galaxy positions for this 
sample against the depth maps are shown in the top-left 
panel of Fig. [5] Correlations are small here (< 0.1%) since no 
aggressive colour cuts are applied. The remaining panels of 
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Fig. [5] show the same correlations for narrow photo-z slices. 
Much larger amplitndes of up to ~ 0.3% occur for some 
samples indicating the importance of the colonr selection. 
Most of the correlations have an amplitnde of ~ 0.1% though 
and do not show significant structnre as a function of 9. For 
the depth, we have information down to the scale of roughly 
an arc-second and the auto-correlations of the depth maps 
do show power at these small sales. The galaxy samples show 
sensitivity to these small scales as some of the correlations 
(such as the 0.5 < iphot < 0.7 sample) are not leveling off as 
they would if there was no dependence. 

Two samples of note are the 0.3 < «phot < 0.5 sam¬ 
ple and the high redshift 0.9 < 2phot < 1-1 sample. The 
0.3 < 2phot < 0.5 sample shows significantly higher cross¬ 
correlations against the depth maps - by about a factor 
of two - compared to the other samples. One would ex¬ 
pect that this sample would be mostly free from systematics 
due to its low redshift and bright nature (i.e. similar to the 
0.1 < 2 < 0.3 sample). This is not the case as selecting this 
sample by photo -2 relies heavily on information from the u- 
band, which is shallower compared to the detection band, to 
distinguish it from the higher redshift selections. This is also 
true for higher redshifts such as 0.9 < 2phot <1.1, which use 
a lot of information from the u- and 2 -bands for the redshift 
determination. Coupled with the faintness of the sample the 
0.9 < 2phot < 1.1 sample exhibits a unique behavior in that 
the cross-correlation against the u-band depth map is sig¬ 
nificantly different from the cross-correlations against the 
other bands, with an opposite sign and a large amplitude. 

Fig- El shows the angular cross-correlation function be¬ 
tween the density of the two LEG samples and the depth in 
the different bands. These galaxy samples show significantly 
larger density variations (not shown), up to five times that of 
the brighter photo -2 selected samples. Likewise we observe 
a roughly 10 fold increase in the amplitude of the cross¬ 
correlations for these samples. This drastic increase is to be 
expected given the faint nature of the LEG samples and 
their sensitivity to changes in the depth. The correlations 
that result from this are largely monotonic in 6 with sig¬ 
nificant correlation at small and intermediate scales. These 
large correlations, in combination with the correlations from 
the other foreground samples, could lead to significant con¬ 
tamination of cross-correlation measurements between the 
different galaxy samples such as the ones used for magnifi¬ 
cation bias. 



Figure 6. Similar to Fig. [5] but for samples of Lyman-break 
Galaxies (LBGs). Note the different y-axis range. 
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Figure 7. Plots of the galaxy over-density 5 as a function of 
the PSF FWHM in the CFHTLenS optical bands for the sample 
0.1 < 2phot < 1-3. 


4.3 Galaxy Density vs. PSF Size 

In this section we analyze the dependence of the galaxy den¬ 
sity on the PSF size. As shown in Fig. Qthese density fluctu¬ 
ations are less pronounced compared to the trends seen with 
survey depth (Sect. 14.21) . They remain significant however 
with fluctuations of around 5% at the highest. The strongest 
trend in the density is found when looking at the i-band PSF 
size. This is likely due to the fact that the i-band is the detec¬ 
tion band. This also suggests that the PSF homogenization 
that correc ts for colour effects from different PSF sizes in 
each band ilHildebrandt et m]| 20121) is working properly. 

Correspondingly Fig. [8] shows the cross-correlation of 
the galaxy densities in the different samples against the PSF 
FWHM maps. The majority of the correlation seen here 
is flat at smaller scales (6 < larcmin). This is not unex¬ 


pected due to the fidelit y of the seeing model we use from 
ikildebrandt et ^ ([20121 1. We do not expect the PSF size 
to be much of an issue at the smallest scales as it tends to 
vary much more smoothly compared to the depth. The am¬ 
plitude we observe is as large as that of the depth and the 
correlation amplitude is almost always largest against the i- 
band PSF size map indicating that much of the correlation 
from the PSF is simply due to the detection process alone. 
Interestingly, the highest redshift slice appears to perform 
contrary to this trend with the correlation against the 2 - 
band PSF FWHM map being largest. This again suggests 
that the higher redshift samples are more sensitive to the 
2 -band through the redshift selection. 

The results are mixed for the LEGs as shown in Fig. [51 
The u-dropouts show large correlation amplitudes roughly 
equal in amplitude to the correlations against the depth 












































Mitigating Angular Systematics 7 



9 [arcmiii] 


6 [arcmin] 


9 [arcmin] 


Figure 8. This figure is similar to Fig. [S] except we now show the correlations against the PSF FWHM in each of the five bands. 


maps. However, the g-dropouts, which have the largest cross¬ 
correlation amplitudes to the depth overall, show very little 
correlation to the PSF size, with the amplitude being an or¬ 
der of magnitude lower that that of the u dropouts. We do 
not have an explanation for this surprising behaviour. 


4.4 Galaxy Density vs. Extinction 

Milky-Way dust can also introduce spurious correlations 
into galaxy samples. High redshift galaxies observed through 
galactic dust will have their colours reddened and thus their 
selection will change as the dust opacity changes. This ef¬ 
fect u sually is corrected us ing the Schlegel-Finkbeiner-Davis 
fSFD ISchlegel et al.lll99^ dust maps or similar data, mod¬ 
ifying the galaxy sample’s colours based on the amount of 
dust measured at a given location. However, these correc¬ 
tions cannot account for changes in the noise properties of 
the detected galaxies or - equally important - any galax¬ 
ies not detected due to the extra opacity. SFD and similar 
maps of dust opacity are also contaminated by high redshift 
galaxies as the infrared emission used to determine the dust 
opacity is contaminated by the emission from t he Cosmic 
Infrared Background fCIB. ISchmidt et al.l 120151 1. In addi¬ 
tion to this, we do not have access to data describing the 
small scale fluctuations of dust extinction. Indeed, the small 
scale str ucture of galactic extinction is an ongoing topic of 
research (IPlanck Collaboration et al.ll2014h . 

Fig. [To] shows the density of the 0.1 < 2phot < 1-3 sam- 



Figure 9. This figure is similar to Fig.[8]except we now show the 
cross-correlations of the densities of the two LBG samples against 
the PSF FWHM maps in the different bands. 

pie plotted against the amount of extinction relative to the 
mean of the pointing. The density variation shows a trend 
from small values to large with the density increasing with 
extinction. The reason is probably again Eddington bias as 
discussed in Sect. [4T] After correction for the reddening the 
main remaining effect of dust is that it changes the depth 
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Figure 11. This figure is similar to Fig. [Slexcep t we now show the co rrelations of galaxy density at different redshifts against the galactic 
extinction E(B — V) from the SFD dust maps llSchlegel et al.iri998h . 



Figure 10. Galaxy over-density 5 against the relative E{B — V) 
value from the SFD dust maps for the sample 0.1 < ^phot < l-S- 

of the data. This systematic mostly affects the lower red- 
shifts as their bluer colours make them more sensitive to 
dust absorption. 

The cross-correlations of the galaxy densities in differ¬ 
ent redshift slices against the dust map are shown in Fig. 1111 
Large amplitudes are found suggesting that the effect of dust 


changing the noise properties of galaxies is at least as im¬ 
portant as the effects from varying depth and seeing. Sur¬ 
prisingly, all samples show a significant anti-correlation at 
large scales (~ 50') on the order of ~ 0.05%. 

In Fig. [12] we show the correlations of the densities of 
the LEG samples against the extinction maps. Unlike for 
cross-correlation against the depth maps, where the LEG 
samples showed amplitudes larger by an order of magnitude 
compared to the photo -2 selected samples, we find that the 
amplitude of the cross-correlation of the extinction with the 
LEG densities is within a factor of 2-3 compared to the 
amplitude for the photo -2 selected samples. While the LEG 
samples have the reddest colours of any sample we consider, 
making them less sensitive to extinction, they rely on non¬ 
detections in the bluer filters for their selection making them 
sensitive to the extinction. 


4.5 Galaxy Density vs Stellar Density 

The density of stars can als o impact the sele ction function 
of galaxy samples (see e. g. I Ross et ^ I 2 OI 2 IL Two effects 
that can cause this are problems with star-galaxy separa¬ 
tion and stellar light halos obscuring galaxies and changing 
their colours. Poor star-galaxy separation causes stars to be 
miss-identified as galaxies and vice versa. The effect is to 
contaminate the galaxy correlation signal with the stellar 
correlation and suppress the amplitude. Galaxies observed 
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Figure 12. This figure is similar to Fig. [TT] except we show the 
correlation for the u- and g-dropout samples. 


through stellar halos in dense stellar fields also have their 
densities and correlations affected. 

We attempt to quantify the amount of correlation be¬ 
tween a stellar sample, galaxies in CFHTLenS, and the other 
systematics, however we find very inconsistent results based 
on how the stellar sample was selected either through colour 
I Hildebrandtet_3 I 2 OI 2 II or purely shape-based methods 
( Miller et al.l 201^ . These different stellar catalogues cor¬ 
relate very differently against a given galaxy sample for all 
magnitudes of the stellar sample. 

We employ an alternative method to test the effect of 
stellar halos by adding larger masks around each of stars 
from both stellar samples. The masks are chosen to reject 
areas 100 times the area of detected stellar objects. Little 
change is observed in the amplitude of the cross-correlations 
at all scales. The higher redshift samples show no change 
with the new more aggressive stellar masking. This and the 
observed flatness of the angular correlation function sug¬ 
gest that the cross-correlation of the stellar samples and the 
galaxy samples is dominated by stellar contamination in the 
galaxy sample rather than the previously mentioned halo 
effect. The stellar contamination is part of a larger issue of 
star-galaxy separation that we do not address in this paper. 

These inconclusive results are the reason why we do not 
consider stellar density as an additional systematic effect in 
our analysis in the following. More generally, any systematic 
that can not be mapped with sufficient accuracy is problem¬ 
atic and can not be analysed/removed with our approach. 
The whole procedure relies on detailed 2-dimensional maps 
of the systematics which are not readily available for the 
stellar density. 


5 MODELING AND REMOVING ANGULAR 
SYSTEMATICS 

Angular systematics from position-dependent selection func¬ 
tions can be a significant portion of the observed signal when 
performing a correlation analysis. In order to remove such 


spurious signals, we employ an empirical method of creating 
weighted random catalogues from the data. 

Spectroscopic surveys already employ a weighting of 
galaxies to corre ct for changes in sensitivity and selec- 
tion functio n (e.g . lNewman et al.ll2013l : lLe Fevre et al ]|2005l : 
IRoss et^hl l2012f) . These weightings are both analytically 
and empirically determined and enable the use of random 
galaxy samples that, when used in correlation estimators, 
remove much of the systematic variation from the dataset. 
This is not as commonly done with photometric surveys as 
selections are thought to be more homogeneous. As we have 
shown above this is not the always the case. 

Simulating the selection effects would be one way to 
account for this. However, selections that use photometric 
redshifts estimated from non-linear fitting are quite complex 
and difficult to simulate. We can, however, remove such sys¬ 
tematics from photometric samples using a fully empirical 
method. Empirical modeling of systematic effects in photo¬ 
metric correlations ha s been used in Fourier-space for large- 
scale BAO analyses (IHo et al.l I 2 OI 2 I : iLeistedt et al.l I 2 OI 3 I : 
iLeistedt fc Peiria l2014l f. Here we describe a method that 
works in configuration space and at small scales (~ larc- 
minute). We estimate weight maps for random points from 
the data itself using a mapping from measured instrumen- 
tal/astrophysical quantities to observed galaxy density. Here 
we consider the effects of depth in five bands, PSF size in 
five bands, and galactic extinction totaling in 11 survey sys¬ 
tematics for CFHTLenS. This method can only work if the 
quantities used are not intrinsically correlated with the ob¬ 
served samples, i.e. through a process that we would like to 
measure. As long as we limit ourselves to instrumental and 
galactic quantities, extragalactic measurements should not 
be affected. 


5.1 Weighted Random Points 


Weighted random points can be used in order to remove 
position dependent systematics induced by a varying se¬ 
lection function. This approach attempts to model and 
remove the systematics by mimicking the selection func¬ 
tion’s effect through random points that follow the under- 
and over-densiti e s. Th ese random points are used in the 
iLandv &l Szalavl lll993h correlation estimator which can be 
written as 


w{9) 


D 1 D 2 — D 1 R 2 — R 1 D 2 -l- R 1 R 2 
R 1 R 2 


( 2 ) 


where D 1 D 2 are the number of pairs as a function of angle 
between the two data samples in the cross-correlation, D 1 R 2 
and D 2 R 1 are the number of pairs between the data and 
samples of random points, and R 1 R 2 are pairs between the 
two random samples. If there is no position dependent selec¬ 
tion function the random catalogues are only required to fol¬ 
low the same geometry as the data catalogues. It should be 
noted that for auto-correlations 1 = 2 but we generalize for 
cross-correlations. This and similar estimators are designed 
to measure excess densities relative to a random distribu¬ 
tion of points. By having the random samples mimic the 
density variations due to the selection function, we can re¬ 
move these contributions from the correlation. In the above 
equation Ri would ideally have the same selection efficiency 
as Di and likewise for R 2 and D 2 . It is advantageous to 
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weigh the random samples instead of the data samples be¬ 
cause this approach decreases shot noise and maps the full 
selection function rather than just at points sampled by the 
galaxies. 

These random points are difficult to construct both an¬ 
alytically and empirically as they require a mapping of the 
values of the systematics to the expected under and over¬ 
densities. Analytically determining the mapping of system¬ 
atics to the induced selection requires an understanding of 
how the systematics affect the selection of the galaxy sam¬ 
ple considered. This is only possible for the simplest of se¬ 
lections and becomes impossible without large assumptions 
when the selection criteria are complex and the systemat¬ 
ics numerous. In principle one could attempt to simulate 
the data and selection process that results in the system¬ 
atics variations. However, this would require complex end- 
to-end simulations of the full galaxy sample, reduction- and 
detection-pipeline, multi-colour photometry, photo-z, etc. 

Determining the weighting empirically (or blindly) 
through mapping galaxy density to the set of systematics 
values is a more realistic option for complex selections. We 
can measure the value of each systematic at every position 
in the survey and map it to the observed density. This is 
the approach we take and it works well as long as there is 
a sufficient density of galaxies to map out the dependency. 
Once the galaxy sample drops below ~ 1 gal/arc-minute 
in CFHTLenS the fidelity of the map suffers. Another dif¬ 
ficulty is if our knowledge of the systematic does not have 
a fine enough resolution to probe the scales of interest or is 
contaminated (as in the case of the stellar density discussed 
above). In this paper, we find that with the majority of 
galaxy samples we are able to fit for much of the systematic 
variation. 


5.2 Data Preparation 

Creating a weighted masking for a photometric dataset such 
as CFHTLenS is complicated and requires careful consider¬ 
ation. This is mainly due to the large volume of data and 
the numerous possible systematics making determining the 
weighting challenging if the galaxy sample under consider¬ 
ation is sparse and the effect is small. The problem is to 
estimate the density of galaxies as a function of 11 variables 
making this a very high-dimensional problem. 

Throughout this analysis we utilize the spherical pix- 
elisation code STOM P and follow a methodology similar to 
IScranton et al.l (l2002l 'l to create the maps of systematics as a 
function of position and compute the correlations. We start 
by describing how the data is prepared, moving on to the 
smoothing of the data and then the creation of the weight 
maps. 

We utilize the STOMP maps created and described pre¬ 
viously in Sect. [2] to map the positional dependence of both 
the survey systematics and galaxy densities. Specific to our 
analysis of CFHTLenS, we use relative values of the density 
and systematics compared to the mean of each pointing in¬ 
stead of the global mean of the survey. This increases the 
signal to noise of our estimates in CFHTLenS (see Fig. 0). 
In order to properly estimate the density of galaxies against 
the systematics, we must find a way to bin the data such 
that the bins capture much of the variation in the system¬ 
atics but contain enough galaxies to properly estimate their 


density. This is a classical problem in high-dimensional data 
analysis and requires an algorithm to efficiently map the pa¬ 
rameter space and group data points. To do this we utilize 
a k-means clustering approach to identify portions of the 
data that have similar systematics values across pointings 
and estimate the galaxy density in these clusters. 


5.3 k-means Clustering 

Binning high-dimensional data efficiently is a non-trivial 
prob lem. We use a k-means clustering method llMacQueenI 
Il967ll in order to bin survey pixels with similar systematic 
values and compute an average of the sample galaxy density 
contained in those pixels. In general, this machine learning 
algorithm aims to locate portions of the data with equal vari¬ 
ance and bin them into clusters. The algorithm attempts to 
minimize 

k 

^ (3) 

i=l Xj^Si 


where fii is the ith cluster mean, the point Xj is a mem¬ 
ber of the set Si of all points nearest to fii, the first sum is 
carried out over all k clusters, and the second sum is car¬ 
ried out over all points in the ith cluster. Cluster centers 
are initially chosen at random. The center is then updated 
using the mean of the nearest data points. The process is re¬ 
peated until the difference in the sum above for new centers 
from one iteration to the next reaches a convergence crite¬ 
rion. We pick this method due to the ease of attributing 
new values to the already determined clusters and the sim¬ 
plicity of interpolating as a function of distance from clus¬ 
ter centers. We use the MiniBatchKMeans method from the 
scikit-learn Python packag^il designed for fast convergence 
on large datasets with a high dimensionality. 

As this technique is minimizing the variance of the clus¬ 
ters, we normalize the variance of each systematic considered 
to unity, that is, each value of the systematics is normalized 
by 


z{e) 


S0) - jsi) 

a{s{6) - {si)) 


(4) 


where z is the normalized value of the systematic at po¬ 
sition 0, s is the original value of the systematic, (si) is 
the average value of the systematic in pointing i containing 
6, and iy{s{9) — (si)) is the standard deviation of the sys¬ 
tematic minus the pointing mean over the full survey. The 
k-means algorithm then considers each dimension equally 
rather than over fitting certain variables due to their small 
variance. This allows the method to fit the systematics with¬ 
out any assumption about the galaxy sample’s sensitivity 
to a given systematic. On application to the CFHTLenS 
dataset the number of clusters we use varies with the density 
of the galaxy sample. We use three different numbers of clus¬ 
ters: 64, 128, 256, which are arbitrarily chosen but span the 
range of under and over fitting the variations in data when 
removing the systematics. We then calculate the average 
over-density and errors in each k-means cluster by spatially 


http://scikit-learn.org/ 
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jackknifing over each of the ^171 pointings of CFHTLenS 
contributing to the cluster. 


5.4 Testing 

We use the mapping of systematics to densities that comes 
out of the k-means clustering approach as a weighting from 
which to draw random samples. We can test the success of 
our weighting scheme by comparing the cross-correlations 
of the galcixy samples under consideration against each of 
the different systematics maps both before and after the 
weighting is applied. If the correction has worked, the cor¬ 
relation between the galaxy samples and the systematics 
maps should equal the correlation between the weight maps 
and the systematic maps within shot noise errors. This is 
a result independent of the science goal. The difference be¬ 
tween these two correlations yields an unbiased estimate of 
the amount of residual systematic error in the measurement 
after the weighting has been applied. It also gives an objec¬ 
tive criterion to discard samples that can not be properly 
corrected using this method as it shows they have more is¬ 
sues with possibly unknown systematics than can currently 
be accounted for. These tests are also independent of cos¬ 
mology or any assumptions pertaining to the galaxy sample 
as long as just instrumental and galactic systematics are 
considered. 


6 RESULTING CORRECTIONS 


Here we present results from modeling the relation of galaxy 
density against the 11 different systematics determined from 
our k-means method. We show how well the weights we 
create remove the systematic compared to the amount of 
systematic correlation we see before and after t he correc¬ 
tion is applied. We do this in a manner similar to lHo et al.l 
ll2012tl where the expected correlation amount for our cross¬ 
correlation is 


{5i52) aysij 


{5l5sysi){52SsySj) 

{^sysj^ ^sysj ) 


(5) 


where {SiS 2 )syaij is the contribution from systematics i, j 
contributing to the galaxy cross-correlation, is the over¬ 
density of galaxies for samples 1 and 2 respectively, and Sayai 
is the ith systematic considered. This equation is only truly 
valid for linear correlations and is likely not completely valid 
for the scales we consider. However, it does give us some 
indication of success. This is the best we can do currently 
without observing the galaxy cross-correlation before and 
after correction. This could add confirmation bias as we may 
stop once this galaxy cross-correlation looks "correct”. The 
plots shown in this section are only for the cases where i = j. 
The cross-terms are left out of the paper for brevity but are 
observed to be as improved as the diagonal terms. 

As an application of this method we compare the resid¬ 
ual systematic correlation signal to the expected signal from 
weak lensing magnification bias. We determine that we have 
successfully mitigated the systematic if the majority of the 
corrected correlations are consistent to within 10% of the 
expected magnification signal and that we see a reduction 
in amplitude for each correlation. It is clear that such a 
criterion does not directly guarantee that the correlation 


functions used in the science analysis are free from any sys¬ 
tematics. However, it is a necessary criterion to assure that 
none of the systematics investigated here swamp the signal. 

Fig. 1131 shows the cross-correlation amplitude between 
the lens galaxy samples and the u Dropouts sample caused 
by the systematics (unconnected coloured a;’s) and this same 
systematic correlation after the weighted correction is ap¬ 
plied. This difference is shown by the black data points con¬ 
nected by the dotted line. The dark shaded region in each 
plot represents our criterion of 10% of the magnification bias 
signal expected from a cross-correlation of the photo -2 sam¬ 
ple and a source sample at 2 = 3. For this signal we also 
have to assume some value for the slope of the source num¬ 
ber counts and here we choose this to be (a — 1) = 10 Addi¬ 
tionally we assume a galaxy bias of & = 1 for the foreground 
sample. 

For the contribution from the depth, the photo -2 se¬ 
lected samples with redshifts 2phot < 0.9 and the global 
redshift sample (0.1 < 2phot < 1.3) correlated against the 
u-dropout sample are mostly consistent with the 10% crite¬ 
rion. For higher redshifts this is not the case, especially in 
the n-band. The 0.3 < 2phot < 0.5 and 0.5 < 2phot < 0.7 
correlations show the most marked improvement, where the 
amplitudes before correction are the largest of all of photo -2 
samples and the amplitude after correction is reduced to a 
level consistent with zero at small scales (0 < 5 arc-minutes) 
for all but the 2 -band. In general, all the correlations are 
consistent with zero after correction at scales 0 < 10 arc- 
minutes. At large scales it is difficult to interpret the results 
due to the noise in these bins at the edge of the survey 
pointings. 

For the higher redshift slices, the improvement is not 
as dramatic. The 0.9 < 2phot < 1-1 sample shows that cor¬ 
relations vs. the survey depth are improved in all bands 
except for the u-band which retains a large coherent am¬ 
plitude at ~ 10 arc-mintues. Explaining this discrepancy is 
difficult considering the marked improvement in the other 
four bands. It is likely due to this sample being significantly 
fainter and sparser than the other samples considered. This 
both makes the systematics more severe due to the faintness 
and their effect harder to estimate due to the sparseness of 
the sample. 

The improvements seen for the low 2phot cross¬ 
correlations comes from both the foreground lens and source 
M-dropout LBGs. The LBGs show the largest correlations for 
all the systematics, especially the u dropout sample. These 
source samples show improvements by an order of magni¬ 
tude from what is observed in Figs. [S] HI and [T^ and are 
a large part of the improvement seen when combined with 
the foreground. The results shown here are similar for the 
p-dropouts which are left out of this section for brevity. 

Fig. [T?] shows the correlations and corrections against 
the seeing for the photo -2 selected and LBG samples. We 
utilise the same empirically estimated density map as in the 
previous figures which is simultaneously correcting for all 
the considered systematics. The systematics observed here 
are roughly on the same order as the depth systematics with 


® This is a conservative value for both drop out samples which 
are m easured to have a slightly higher value llHildebrandt et al.l 

[20^. 
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0.1 < Zp}iot 'C 1-3 0.1 < Zphot ^ 0.3 0.3 *c ^phot ^ 0.5 O.o <c Zp^Qi < 0.7 0.7 <c Zp^Qi < 0.9 0.9 ZpfiQf < 1.1 



Figure 13. Symmetric log plots of the observed and corrected depth systematics for foreground lens galaxies cross-correlated against the 
It' dropout sources. The y axis is plotted linearly for the range ilO”"*. The quantity plotted on is the estimated contribution from the 
depth to the magnification cross-correlation assuming linear systematics only. Unconnected a;’s show the raw systematic contamination. 
The black dashed line is the estimated correlation after the correction is applied again assuming linear systematics. Columns show these 
correlations for the different photo-z selected samples and the rows show the correlations against the different bands. The dark shaded 
region is ±10% of the expected magnification signal for the photo-z sample (assuming a galaxy bias of 6 = 1) lensing the u-dropout 
sample (assuming a slope of the number counts of (a — 1) = 1). 


more of the seeing systematic correlation amplitudes being 
outside of our criteria. After correction, the correlation am¬ 
plitudes show marked improvement with the z-band and i- 
band being consistent with zero. The other bands are slightly 
less successful but are made consistent with our criteria. 

Fig.[T5]shows the extinction map correlated against the 
different CFHTLenS galaxy samples and the corrected cor¬ 
relations. Modelling of this systematic is the least successful, 
though for the lower redshift (zphot <0.9) galaxies the am¬ 
plitude is reduced by an order of magnitude for most sam¬ 
ples. One possible reason for this is that the extinction is 
the only variable considered that is contaminated by signal 
from large-scale structure. The E{B — V) extinction maps 
are in part determined by infrared emission and partly con¬ 
taminated by e mission from the cosm ic infrared background 
as shown in e.g. ISchmidt et ^ (l2015l ). 

Overall, we find encouraging results in modeling and 
removing the largest systematics, reducing much of the spu¬ 
rious signal from the 11 systematics considered to consistent 
with zero for the expected contribution to the signal from 
systematics. This is largely true for the lower redshift bins, 
i.e. samples with 2phot < 0.9. For higher photo -2 slices, the 


corrections and modeling tend to break down. This is likely 
due to the increased sparseness and faintness of these sam¬ 
ples with the mean magnitude shifting by 1 magnitude in 
the most extreme case. These samples also have less that 
l/3rd of the galaxies as the number of lower redshift sam¬ 
ples. These faint samples are much more susceptible to the 
effect of systematics and it is likely there is not enough data 
to constrain these effects. We find more success with the 
Lyman-break galaxy samples likely due to the simple na¬ 
ture of the colour-colour selection. Large-scale corrections 
can be problematic for some samples though this is difficult 
to interpret due to noise. This is likely due to smoothing we 
perform, treating each pointing individually rather than the 
whole survey simultaneously. 


7 DISCUSSION AND CONCLUSIONS 

In this paper we have presented an investigation into sys¬ 
tematic density variations of galaxy samples in CFHTLenS 
caused by variations of instrumental and astrophysical quan¬ 
tities. We also present a model that attempts to mitigate 
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0.1 < Zp}iot 1-3 0.1 < ^phot ^ 0.3 0.3 *c ^phot ^ 0.5 O.o Zp^Qi < 0.7 0.7 <c Zp^Qi 0.9 0.9 ZpfiQf < 1.1 



Figure 14. Same as Fig. I13l but showing the correlations against the PSF FWHM maps instead of the survey depth. 


these effects, empirically modeling galaxy density as a func¬ 
tion of survey systematics. We model these survey sys¬ 
tematics with an eye toward measuring magnification in 
CFHTLenS and other similar multi-epoch, wide area sur¬ 
veys. We consider survey systematics of limiting magni¬ 
tude (depth), PSF size (FWHM), and galactic extinction 
{E{B — F)). In total for CFHTLenS this is 11 systematic 
variables: 5 depth, 5 PSF sizes, and 1 extinction. We find 
that, without a correction, correlations induced by variations 
in such systematics can be significant compared to the in¬ 
tended measurement. We find induced density fluctuations 
in small fractions of the area of the survey of up to 10% 
in photo -2 selected samples and up to even 50% in high -2 
Lyman-break galaxy samples. This can result in correlation 
amplitudes of 0.1% and 1.0%, respectively. In the case of 
magnification bias, this spurious correlation can be as large 
as twice the expected amplitude of the signal one wants to 
measure! 

We modeled these systematics by identifying regions of 
similar systematic values in the survey (in the 11 dimensions 
mentioned above) using a k-means clustering approach and 
estimating the average galaxy over-density as a function of 
the survey systematics. Using this method we see improve¬ 
ment in the spurious cross-correlation systematics for the 
majority of the galaxy samples considered. In some cases, 
the weighted correction reduces the expected contribution 
from systematics such that it is consistent with zero. The 


high redshifts, unfortunately, do not improve considerably 
and still show significant correlation of similar amplitude to 
the magnification. This is likely due to the large difference 
in brightness of the samples, with the higher redshift sam¬ 
ples being much fainter and sparser. These fainter samples 
are then more sensitive to selection effects and, due to these 
samples being sparser and noisier, we are unable to effec¬ 
tively estimate a correction with high precision. 

The LEG samples considered are an exception. While 
they are the faintest and sparsest samples that we are trying 
to correct, their corrections work surprisingly well. This is 
likely due to the fact that the LBGs are selected with simple, 
two-dimensional colour cuts. The high -2 photo -2 samples in 
contrast are selected in a much more complicated way by 
the inner workings of a photo -2 code {BPZ in our case) so 
that it is conceivable that their dependence on systematics 
is also more complicated. 

The analysis presented here directly enables measure¬ 
ments of magnification bias in the GFHTLenS. In the fu¬ 
ture, we will apply this technique to other surveys such 
as RGSLenS and KiDS. These surveys are very similar to 
GFHTLenS (especially in terms of data processing) and will 
enable quick turn around on the science. 

This technique will benefit from larger area, deeper fu¬ 
ture surveys, however, care will need to be taken to assure 
that computation time is tractable. The k-means algorithm 
used in this work is designed for large datasets and will 
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Figure 15. Similar to Fig. 1131 but for correlations against the E{B — V) map as derived from the SFD dust maos lISchlegel et al.lll99^ . 
For these plots, each separate panel is for a different galaxy sample. 


likely scale properly though further investigation is needed. 
In order to proceed with these larger datasets and properly 
remove systematics down to smaller scales, simulations of 
the dataset mimicking the observing strategy and large-scale 
structure will be required. This will give an understanding 
of both density dependence on survey systematics, but also 
systematics from changes in densities caused by deblending 
and light halos. This method could then be trained with 
simulated catalogs that mimic the systematics probed. This 
is left for future work. 

The fundamental limitation of this technique is the h- 
nite size of the galaxy sample that is used to map the de¬ 
pendency on the different systematics. While larger surveys 
will contain more information to establish this relation those 
surveys will also require systematic errors to be controlled 
to a higher level. It is not clear at the moment which of the 
two aspects will win in the end, i.e. whether our ability of 
modelling systematic effects will scale more favourably with 
increasing data volume than statistical noise or the opposite. 
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